For this final project, I use a data set called Global internet User. It is a data set that show us how many users are using the internet throughout 40 years. The data contain, country, country code, cellular subscription, internet user in number and in percentage. The research question that I have for this data set is what is the internet user number look like through out the year and how much different it is in different continent. Also, how much percentage of people have access of the internet.

In this linear graph, it is showing the sum of all internet user in each year. As year passed, the internet user start to grow higher and higher but as it is closer to recent year the slope of the growth is steeper. Which show no sign of going down at all as year progress. The area that is showing steep linear line is in 2019 and 2020. It is the time that COVID-19 happen and people were in quarantine and use more internet.

In this graph, I used multiple layer like linear line to show the average and scatter plot to show the data for each country. I use the filter to show the top 7 country that have the most access to internet. The y axis is telling us the number of subscription people have in 100 people. It is over 100 because one person can have more than one subscription. The x axis is the percentage of the population. This graph suggesting that in Hong Kong, most of the population have access to the internet and one person have multiple cellular description.

In this graph is also a line graph but show all the data for every country and is facet wrap by continent. So that the line will not be too overlapping. Because of Asia continent, it makes other continent look like they are few internet user. I seperated the graph to see the other graph line better. It seems like the number of internet user is correlate with the number of population. China have the most population so they have larger number of internet user.

This is the global map of the internet user in 2020. As you can see there are some country that is missing. It is because the value of the country in two of the data set I am using is not the same. So, when I left joined them together, they do not recognize it. When I see that graph, it make me feel like other country barely have much internet users. It would be interest to see, if I change the filling to the percentage of the population that have access to the internet. We might see more purple and red.